A Data Quality Metamodel Extension to CWM

نویسندگان

  • Pedro Gomes
  • José Farinha
  • Maria José Trigueiros
چکیده

The importance of metadata has been broadly referred in the last years, mainly in the field of data warehousing and decision support systems. Contemporarily, in the adjacent field of data quality, several approaches and tools have been set out for the purpose of data profiling and cleaning. However, little effort has been made in order to formally specify metrics and techniques for data quality in a structured way. As a matter of fact, little relevance has been assigned to metadata regarding data quality and data cleaning issues. This paper aims at filling this gap, proposing a conceptual metamodel for data quality and cleaning, both applicable to operational and data warehousing contexts. The presented metadata model is integrated with OMG’s CWM, offering a possible extension of this standard toward data quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Semantic Approach towards CWM-based ETL Processes

Nowadays, on the basis of a common standard for metadata representation and interchange mechanism in data warehouse environments, Common Warehouse Metamodel (CWM) – based ETL processes still has to face significant challenges in semantically and systematically integrating heterogeneous sources to data warehouse. In this context, we focus on proposing an ontology-based ETL framework for covering...

متن کامل

A Standard for Representing Multidimensional Properties: The Common Warehouse Metamodel (CWM)

Data warehouses, multidimensional databases, and OLAP tools are based on the multidimensional (MD) modeling. Lately, several approaches have been proposed to easily capture main MD properties at the conceptual level. These conceptual MD models, together with a precise management of metadata, are the core of any related tool implementation. However, the broad diversity of MD models and managemen...

متن کامل

Integration and Reuse of Heterogeneous Information: Hetero-Homogeneous Data Warehouse Modeling in the Common Warehouse Metamodel

The corporate data warehouse integrates data from various operational data stores of a company. These operational data stores may be heterogeneous with respect to the represented information. The hetero-homogeneous data warehouse modeling approach overcomes issues associated with the integration of heterogeneous information from the operational data stores by featuring a generally homogeneous s...

متن کامل

Das Common Warehouse Metamodel als Referenzmodell für Metadaten im Data Warehouse und dessen Erweiterung im SAP Business Information Warehouse

Heterogene Data Warehouse-Landschaften sind durch eine Vielzahl verschiedener Softwarekomponenten gekennzeichnet, deren Integration zu einer funktionierenden Business Intelligence-Lösung eine besondere Herausforderung darstellt. Die Metadaten der beteiligten Komponenten stellen dabei einen viel versprechenden Ansatz der effektiven und effizienten Verknüpfung dar, die aber durch die proprietären...

متن کامل

Um Metamodelo para a Especificação de Data Warehouses Geográficos

The decision-making processses can be supported by many tools such as DW (Data Warehouse), OLAP (On-Line Analytical Processing) and GIS (Geographical Information System). Much research found in literature is aimed at integrating these technologies. However, the metamodeling of spatial and dimensional schemas for GDW (Geographical DW) is still an open question. In this context, this paper propos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007